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Abstract 

In Part II of this two-part paper, a sum-rate-maximizing power allocation with minimum power consumption is 
found for multiple-input multiple-output (MIMO) decode-and-forward (DF) two-way relaying (TWR) in a network 
optimization scenario. In this scenario, the relay and the source nodes jointly optimize their power allocation strategies 
to achieve network optimality. Unlike the relay optimization scenario considered in part I which features low 
complexity but does not achieve network optimality, the network-level optimal power allocation can be achieved in 
^ , the network optimization scenario at the cost of higher complexity. The network optimization problem is considered 

O [ in two cases each with several subcases. It is shown that the considered problem, which is originally nonconvex, 

can be transferred into different convex problems for all but two subcases. For the remaining two subcases, one for 
^ each case, it is proved that the optimal strategies for the source nodes and the relay must satisfy certain properties. 

' Based on these properties, an algorithm is proposed for finding the optimal solution. The effect of asymmetry in 

■ the number of antennas, power limits, and channel statistics is also considered. Such asymmetry is shown to have a 

CO . negative effect on both the achievable sum-rate and the power allocation efficiency in MIMO DF TWR. Simulation 

^SJ , results demonstrate the performance of the proposed algorithm and the effect of asymmetry in the system. 
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I. Introduction 

Two-way relaying (TWR) is a promising protocol featuring high spectral efficiency lH]. Optimizing transmit 
^ , strategies such as power allocation of the participating nodes in a TWR helps to maximize the spectral efficiency 

■ ■ ■ in terms of sum-rate (12]- lEj- As shown in Part I of this two-part paper |12|, achieving the maximum sum-rate 

in TWR, however, does not necessarily demand the consumption of all the available power at all participating 
nodes. As a result, it is of interest to find the power allocation which minimizes the power consumption of the 
participating nodes among all power allocations that achieve the maximum sum-rate in TWR. For brevity, this 
objective of optimizing the power allocation at the participating nodes is called the sum-rate maximization with 
minimum power consumption. In Part I of this two-part paper, the problem of relay optimization for multiple-input 
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multiple-output (MIMO) decode-and-forward (DF) TWR is investigated, in which the relay optimizes its own power 
allocation to achieve sum-rate maximization with minimum power consumption given the power allocation of the 
source nodes. The solution of the relay optimization problem derived in Part I gives the optimal power allocation of 
the relay in a MIMO DF TWR system in the case when there is no coordination between the relay and the source 
nodes. Although this power allocation is in general sub-optimal on the network level, it is a viable and preferable 
solution for power allocation when the considered MIMO DF TWR system has limitation on the computational 
capability of finding the power allocation strategy. If the participating nodes have sufficient computational capability, 
a better performance than that in the relay optimization scenario can be achieved. In such a case, the relay and 
the source nodes can jointly optimize their power allocation strategies for sum-rate maximization with minimum 
power consumption. 

Joint optimization of transmit strategies of the relay and source nodes for MIMO TWR has been studied in 
El- Q. Transmit strategies for maximizing the weighted sum-rate of a TWR system are studied in [4|, in which 
the optimal solution is found through alternative optimization over the transmit strategies of the relay and source 
nodes. In Q, a low-complexity sub-optimal design of relay and source node transmit strategies is derived for either 
sum-rate maximization or power consumption minimization under quality-of-service requirements. The joint source 
node and relay precoding design for minimizing the mean-square-error in a MIMO TWR system is studied in ||6l. 
The optimal solution is found through an alternative optimization of several sub-problems obtained from the original 
non-convex problem. The authors in JT] solve the robust joint source and relay optimization problem for a MIMO 
TWR system with imperfect channel state information. Deriving the optimal solution for the joint optimization 
problem generally requires alternative optimization over the transmit strategies of the relay and the source nodes, 
which leads to high complexity H, ||6l, Q. All the above works consider MIMO amplify-and-forward (AF) TWR. 

Considering the fact that DF TWR may achieve better performance than AF TWR, especially at low signal-to- 
noise ratio (SNR) f8 |, and the fact that DF TWR has the flexibility of performing separate power allocation/precoding 
for relaying the communication on each direction, it is of interest to study the problem of joint optimization over 
the power allocation strategies of the relay and the source nodes for MIMO DF TWR. If we further consider the 
power efficiency, the problem becomes more complicated. Part II of this two-part paper studies the problem of 
sum-rate maximization with minimum power consumption for MIMO DF TWR when the relay and the source 
nodes jointly optimize their power allocations. This scenario is referred to as network optimization scenario. The 
objective of this part is to find the joint optimal power allocation of the relay and the source nodes while reducing 
the complexity of finding the optimal solution. The contributions of Part II are as follows. 

First, we show that the considered network optimization problem is nonconvex. Based on the comparison of the 
maximum achievable sum-rates of the multiple-access (MA) and broadcasting (BC) phases, the network optimization 
problem is considered for the case that the maximum achievable sum-rate of the MA phase is lager than or equal to 
that of the BC phase and the case that the maximum achievable sum-rate of the MA phase is less than that of the BC 
phase, respectively. In each case, we show that the original problem can be transferred, under certain conditions, into 
equivalent convex problem(s) which can be solved with low complexity. Accordingly, the above two cases are further 
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analyzed in terms of subcases. For the subcases in which the original problem can be transferred into equivalent 
convex problems, the problem of sum-rate maximization and the problem of power consumption minimization are 
decoupled so that the sum-rate in one of the MA or BC phase is maximized while the power consumption in the 
other phase is minimized. The complexity of finding the optimal solution of the network optimization problem in 
the above subcases is therefore low. 

Second, for the remaining two subcases in which the original problem cannot be transferred to a convex form, 
we prove properties that the optimal solution must satisfy. Based on these properties, we propose algorithms for 
finding the joint optimal power allocations for the relay and the source nodes. While the proposed algorithms find 
the optimal solution in iterations, the optimization problems that the rely and source nodes need to solve in each 
iteration are convex and simple. As a result, the complexity of the proposed algorithms for finding the optimal 
solution of the nonconvex joint optimization problem is acceptable in these two subcases. 

Third, we demonstrate the effect of asymmetry on MIMO DF TWR in the network optimization scenario. Similar 
to the relay optimization scenario, we show that asymmetry in power limits, number of antennas, and channel 
statistics can lead to performance degradation in both the achievable sum-rate and the power allocation efficiency. 
Specifically, we show that the optimal power allocation in both of the aforementioned two subcases in which the 
original problem cannot be transferred to a convex problem is not as efficient as that in other subcases. Then, it is 
shown through analysis and simulation that the asymmetry in the power limits, number of antennas, and channel 
statistics leads to a larger occurrence probability of the above-mentioned two subcases. As a result, we show that 
asymmetry leads to performance degradation in the MIMO DF TWR system. 

The rest of the paper is organized as follows. Section |II] gives the system model of this work. The network 
optimization problem is studied in Section |lll] Simulation results are shown in Section |IV] and Section IV] concludes 
the paper Section |VT] 'Appendix" provides proofs for the lemmas and theorems. 

II. System Model 

A TWR with two source nodes and one relay is considered, where source node i {i = 1,2) and the relay have 
Hi and rii antennas, respectively. The information symbol vector and the precoding matrix of source node i are 
denoted as and W^, respectively, where is a complex Gaussian vector with E{si} = 0, i?{sisj^} = I, and 
E{siS^} = in which the superscript (•)^ stands for the conjugate transpose and I denotes the identity matrixQ 
The channels from source node i to the relay and from the relay to source node i are denoted as H^i and Hii, 
respectively. It is assumed that source node i knows Hij and the relay knows Hji-,Vi. It is also assumed that the 
relay knows Hii,Vi by using either channel reciprocity or channel feedback. For example, if the system works 
in the time-division duplex mode, Hii,Vi are known at the relay due to channel reciprocity. Otherwise, when the 
system works in the frequency-division duplex mode, the relay needs feedback from the source nodes to obtain 
H„,Vz. 

'it is assumed as default throughout the paper that the user index i and j satisfy i ^ j. 
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In the MA phase, source node i transmits the signal WjS^ to the relay. The sum-rate of the MA phase, denoted 
as i?"^(D), is bounded by IlOl 



i?"^^(D) = log 



I+(Hi,DiHH. + H2,D2H«)(a2)-i 



(1) 



where = W^W^, Vi, D = [Di, D2] and a^I is the noise covariance matrix at the relay. 

The relay decodes Si and S2 from the received signal, performs precoding for each of them, and then forwards the 
superposition of the precoded information symbols to the source nodes in the BC phase. Note that the Exclusive-OR 
(XOR) based network coding is adopted at the relay in some works (for example lITTI ). While XOR based network 
coding may achieve better performance in terms of sum-rate than the symbol-level superposition, it relies largely on 
the symmetry of the traffic from the two source nodes. The asymmetry in the traffic in the two directions can lead 
to significant degradation in the performance of XOR in TWR lfT2l . ifTsl . As the general case of TWR is considered 
and there is no guarantee of traffic symmetry, the simple approach of symbol-level superposition is assumed here 
at the relay as it is considered in ([T]. 

With the receiver side channel knowledge and the knowledge of the relay precoder, each source node is able to 
subtract its self-interference from the received signal. Denote T^^ as the relay precoding matrix for relaying the 
signal from source node j to source node i. Let B; = TiiTj!^, Vz and B = [Bi, B2]. Then the information rate for 
the communication from the relay to source node i, denoted as R^i(Iii), is expressed as 

R,i{Bi) = log |I + (H„B,HH)(a2)-i| (2) 

where afl is the noise covariance matrix at source node i. The sum-rate of the BC phase, denoted as i?'^^(B), is 

i?'^^(B) =4i(Bi) + 4-2(B2). (3) 



The end-to-end information rate from source node j to source node i, denoted as i?ji(Bi,Dj), is bounded by 

1 
2 



i?,,(B„ D,) = - mm{R„{B,), Rj^iB/)} (4) 



where 

i?,,(D,) = log |I + (H,,D,HH)(fT2)-i|. (5) 
Then the sum-rate for communication over both MA and BC phases for the considered DF TWR can be written 



as 



m 



i?'^(B, D) = ^ min{i?'"^(D), i?(B, D)} (6) 



where 

i?(B,D) = min{4i(Bi),i?2r(D2)} 

+ min{4.2(B2),i?ir(Di)}. (7) 

Denote the singular value decomposition (SVD) of Hi^ as Hi^ — UiiJ7iiV^. We assume that the first r^i 
(fri < min(ni, rir)) diagonal elements of fli-i, denoted as . . . ,u!fi{rH), are non-zero. Since the source nodes 
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can subtract their self-interference in the BC phase and the relay has channel knowledge of Hii,Vi, the power 
allocation of the relay for relaying the signal in either direction should be based on waterfilling regardless of how 
the relay distributes its power between relaying the signals in the two directions. The actual water-levels used by 
the relay for relaying the signal from source node j to source node i is denoted as 1/Ai, V«. With water-level 1/Xi, 
B, can be given as B, = V„P„(A,)VH where P„(A,) = diag(^(J- - • ■ • , (i-sitb))^' • ■ - o) 

in which diag( ) stands for making a diagonal matrix using the given elements, (•)+ stands for projection to the 
positive orthant, ai{k) = \LUri{k)\'^ / af , and there are (rir— th) zeros on the main diagonal of Pri(Ai)|^ It holds 
that 



Rri{B,)^yjog{l+{y^a,{k)-l] ] (8a) 



keii 



i?r2(B2) = (l + -A ) (8b) 



kei2 

where li — {1, . . . Therefore, the rate _Rri(B,;) obtained using water-level 1/A,; is alternatively denoted as 

From equation (|6]l, it can be seen that the maximization of the sum-rate using minimum power potentially involves 
balancing between R™^(D) and i?(B,D) and between i?ri(Bi) and /Jjr(Dj),Vi. However, it is not explicit how 
such rate balancing affects the power allocation of the relay and the source nodes. In order to adjust the above 
rates through power allocation, we introduce the relative water-levels. Same as in Part I, l//ii(Di), 1/^12(^2), and 
l/A*ma(D) are defined as 

Y,\og(l+(—^^a2{k)-lY)=R,,{B,) (9a) 



Given the above definition, if waterfilling is performed on Wrj(fc)'s, Vfc G Ij using the water-level l//ii(Di), 
then the information rate of the transmission from the relay to source node j using the resulting power allocation 
achieves precisely Ri,:(Di). If waterfilling is performed on Wrj(^)'s> Vfc G Ii,yi using the water-level l//inia(D), 
then the sum-rate of the transmission from the relay to the two source nodes using the resulting power allocation 
achieves precisely i?"^''(D). For brevity, /ii(Di), ^12(^2), and /^ma(D) are denoted hereafter as fii, ^2 and /ima, 
respectively. The same markers/superscripts on D,; and/or D are used on /i; and/or /^nia to represent the connection. 
For example, /ii(D°) and /ii„a(D) are briefly denoted as and //,na, respectively. 

For the network optimization scenario considered here, the relay and the source nodes jointly maximize the 
sum-rate in (|6]l with minimum total transmission power in the network|^ Similar to the relay optimization scenario, 

^Details on waterfilling based solution of power allocation can be found, for example, in Section III. A in 1141 . 

^The term 'sum-rate' by default means i?*"'(B, D) when we do not specify it to be the sum-rate of the BC or MA phase. 
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the relay needs to know Wi and W2 while both source nodes need to know Tri and T12. It is preferable that the 
TWR is able to operate in a centralized mode in which the relay can serve as a central node that carries out the 
computations. If the system works in a decentralized mode, it may lead to high overhead because of the information 
exchange during the iterative optimization process. 

Given the above system model, we next solve the network optimization problem. 

III. Network optimization 

In the network optimization scenario, the relay and the source nodes jointly optimize their power allocation to 
achieve sum-rate maximization with minimum total power consumption in the system for the MIMO DF TWR. 
Compared to the optimal solution of the relay optimization problem in Part I, the optimal solution of the network 
optimization problem achieves larger sum-rate and/or less power consumption at the cost of higher computational 
complexity. 

The sum-rate maximization part can be formulated as the following optimization problerrQ 

max i?'''(B,D) (10a) 

{B,D} 

s.t. Tt{D,} < P^^'^yi (10b) 
Tr{Bi +B2} < P,'"^'^ (10c) 

where p^'^^ and p™^^ are the power limits for source node i and the relay, respectively. The above problem is a 
convex problem which can be rewritten into the standard form by introducing variables t,ti,t2 as follows 

max t (11a) 

{t,ti,i2,B,D} 

S.t. t < ff"''(D), t<ti+t2 (lib) 

t,<R,j{B,), U<R„{T>,),Vi (11c) 

Tr{D,} < i^'"'''^, Vi, Tr{Bi + B2} < P"'''''. (lid) 

If transmission power minimization is also taken into account, the following constraints are necessary 

Rr^{B,) < Rj,{Bj),yi. (12a) 

i?"^^(D) = i?(B,D). (12b) 

The reason why the above constraints are necessary if transmission power minimization also needs to be taken into 
account is as follows. Given the fact that i?'"''(D) < Pii.(Di)+P2r(D2) whenever Tr{Di}+Tr{D2} > 0, it can be 
shown that the power consumption of the relay can be reduced by reducing Tr{Bi} without decreasing the sum-rate 
i?*^(B, D) in dill if Rrr{B,) > Pjr(Dj). Therefore, the constraint (|T2at is necessary. Subject to (fT2a] l. i?*^(B, D) 

*The positive semi-definite constraints 0, Vi and >2 0, Vi are assumed as default and omitted for brevity in all formations of 

optimization problems in this paper 
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in © can be written as min{i?'"^(D), ^ri(Bi) + i?r2(B2)}/2. Using the fact that i?'"^(D) < Ai(Bi) + 4-2(B2) 
when i?ri(Bi) = -R2i(D2) and i?r2(B2) = ^ir(Di), it can be shown that the power consumption of at least one 
source node can be reduced without decreasing i?*"(B,D) if i?™''(D) > i?(B,D) while the power consumption 
of the relay can be reduced without decreasing i?*'^(B,D) if i?'"''(D) < i?(B,D). Thus, the constraint ( I12bb is 
also necessary. 

Considering the constraints (112a) and (I12bb . the problem of finding the optimal power allocation akeady becomes 
nonconvex. Relating (|9ali-(|9c]) with dSali-dSbll. the above two constraints ( |12at and illhj can be rewritten as 

Xi > iij^i (13a) 



(13b) 

It should be noted that the constraints (I12ab and ( I12bb . or equivalently (I13ab and (I13bl ). are not sufficient in general. 
Due to the intrinsic complexity of the considered problem, it is too complicated to formulate the general sufficient 
and necessary condition for optimality for the original problem of sum-rate maximization with minimum power 
consumption. Instead, we will show the sufficient and necessary optimality condition for the equivalent problems in 
the subcases in which the original problem can be transferred into equivalent convex problems. For other subcases, 
we will develop important properties based on the above necessary conditions which can significantly reduce the 
computational complexity of searching for the optimal solution. 

In the scenario of network optimization, the three nodes aim at finding the optimal matrices D and B that 
minimize Tr{Di} + Tr{D2} + Tr{Bi + B2} among all D and B that achieve the maximum of the objective 
function in ( fTob . Considering the fact that the optimal B and D depend on each other, solving the considered 
problem generally involves alternative optimization of B and D. It is of interest to avoid such alternative process, 
when it is possible, due to its high complexity. Next we use an initial power allocatioijfl to classify the problem of 
finding the optimal B and D for network optimization into two cases, each with several subcases. 

Consider the following initial power allocation of the source nodes and the relay, which decides the maximum 
achievable sum-rates of the MA and BC phases, respectively. The source nodes solve the following problem 

max ^"''(D) (14a) 

s.t. Tr{DJ < i^'"^'^,Vi. (14b) 

It is worth mentioning that the problem (fT4] l is a basic power allocation problem on multiple-access channels 
studied in Ifl?! . Denote the optimal solution of the above problem as D*^ = [D5,D2]. The relay allocates p™^'^ 
on ai{kys, Vfc £ Ii,Vi based on the waterfilling procedure. Denote the initial water level as 1/A". The case when 
i?™^(D'') > Ri-i{X'^) + R^2{X'^), i-e., when the maximum achievable sum-rate of the MA phase is lager than or 

'Note that the initial power allocation is not the solution to the considered problem and it is only used for enabling classification. 
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equal to that of the BC phase, is denoted as Case I and the case when i?™^(D°) < R^i{X'^) + /ir2(A"), i.e., when 
the maximum achievable sum-rate of the MA phase is less than that of the BC phase, is denoted as Case II. The 
joint optimization over B and D will be studied in each of these cases. The following lemma that applies to both 
cases is introduced for subsequent analysis. 

Lemma 1: Given Di and Da with Pf*^^ > Tr{Di} > and P^"^'' > Tr{D2} > 0, if > l//i,na > 
then the following two results hold true: 1). l/^,na(D) < l/i-ij where D = [Di,D2] with Dj = and = Dj, 
2). there exists t G [0, 1) such that with = <Dj and = Dj, we have l///i(Di) > l//iijia(D) = l/z-'j where 
D= [61,62]. 

Proof: See Subsection IVI-AI in Appendix. ■ 
Lemma 1 relates the source nodes transmit strategy D with the relative water-levels l//i2, and l//ima. It 

shows a range in which the relative water-level l//ima can change when fixing Dj and changing given that 

l/Hi > l/^ima > 

Lemma 2: The optimal solution of the network optimization problem has the following property 

\j = /ij > ^,na if K < Aj or /ij > ^nia- (15) 

Proof: See Subsection IVI-BI in Appendix. ■ 
Lemma 2 develops a property of the optimal solution that follows from the constraints (113a) and (I13bl i. This 

property is needed for future analysis. 

We next study the problem of maximizing P*™(B, D) with minimum power consumption and find the optimal 

power allocation for Cases I and II, respectively, in the following subsections. 

A. Finding the optimal solution in Case L is., P™^(D'^) > R-ci{\^) + -Rr2(A'^) 

Since P™^(D0) > 4i(A°) + Pr2(A°), it can be shown that I/Aq < l/Mma- I" this case, the sum-rate i?'^(B,D) 
in (|6]l is upper-bounded by the sum-rate Pi.i(A'') + Pr2(A'^). The following two subcases should be considered 
separately. 

Subcase I-l: The following convex optimization problem is feasible 

rnin Tr{Di} + Tr{D2} (16a) 

s.t. P''^^(D) > 4i(A°) + Pr2(A°) (16b) 

i?lr(Di) > 42(A°) (16c) 

^2r(D2) >4i(A°) (16d) 

Tr(Di) < i^'"^'^,Vi. (16e) 

In this subcase, the maximum sum-rate P''"(B. D) can achieve Pi.i(A'^)+i?r2(A°). In orderto achieve this maximum 
sum-rate, it is necessary that Ai = A2 = A". Therefore, the relay should use up all available power p^'^'^ at 
optimality, and the optimal B^, Vi is equal to ViiPi.j(A°)Vj^ where 'P^i{\i) is given in Section As a result, the 
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original problem simplifies to finding the optimal Di and D2 such that it!'^(B,D) achieves Ryi{X'^) + i?i2(A") 
with minimum power consumption. Using equations (|6]l and (|7]i, it can be shown that the sufficient and necessary 
condition for D to be optimal in this subcase is that D is the optimal solution to the convex optimization problem 
(fT6] l. Denoting the optimal solution to the problem (fTSI l as D* — [D*,D2], the total power consumption in this 
subcase is P^''''' + Tr{D{} + Tr{D5}. 

It can be seen that the optimal solution of B and D described above satisfies the necessary condition (I13al l as 
the constraints ( |16ct and ( |16dt are considered in the problem ( fT6l ). It can also be shown that the above optimal 
solution in Subcase I- 1 satisfies the necessary condition ( |13b| i. as stated in the following theorem. 

Theorem 1: The optimal solution in Subcase I-l satisfies /i*^^^ = A" where = /inia(D*), and thereby satisfies 
(I13bl i given that Ai = A2 = A*' at optimaUty. 

Proof: See Subsection IVI-CI in Appendix. 

Considering the constraints (I16bb - (ll6et . it can be seen that the problem (fTSI l is feasible if and only if the optimal 
solution to the following problem 

max i?,r(D,) (17a) 

D J J ' 

s.t. i?„.(D,) > 4j(A°) (17b) 

^"'^(D) > 4i(A°) + 42(A°) (17c) 

Tr(D,)<i^'°'''',Vi (17d) 

denoted as D*, satisfies i?jr(D*) > 4»(A°)|] However, it is possible that _Rji.(D*) < _Ri.i(A"). It is also possible 
that the problem ( fTTI i is not even feasible. In both of the above two situations the problem ( fTSI l is infeasible. This 
leads to the second subcase of Case I. 

Subcase 1-2: The problem ( fT6] l is infeasible. 

Unlike Subcase I-l, the maximum sum-rate i?*™(B,D) in this subcase cannot achieve i?ri(A") + i?r2(A"). As 
mentioned above, there are two possible situations when the problem ( fTSI l is infeasible: (i) Rjj-CD*) < _Rii(A"), 
and (ii) the problem ([IT) is infeasible. Using Lemma 1 in Part I of this two-part paper [(21 and the fact that 
jjma^-po-j > i?j.2^(A*')+_Ri2(A'') for Casc I, it can be shown that if the problem ( fTTj l is infeasible for specific values of 
i and j, then it is feasible (but Rjj-CD*) < _Rri(A°)) when the values of i and j are switched. Therefore, the problem 
(fTST i is infeasible if and only if there exists at least one specific value of j in {1, 2} such that Rjj-CD*) < i?ri(A°) 
in the problem ( [TtI i. It infers, based on the definitions (|9a]l-(|9c]l, that < 1/A" whenever l//ima > 1/A" and 
1/A*i > 1/A". As a result, whenever l/fima > 1/A°, or equivalently, i?™''(D) > 4i(A°) + Ri-2iX°), the sum-rate 
R^^{B,T>) is bounded by Rri{Xi) + Rr2{X2) (according to equation Q), which is less than Rri{X°) + Rr2{X") 
when < 1/A" (according to the constraint (|13a) ). Moreover, whenever l//ima < 1/A", or equivalently, 

^map) < R,i{X°) + Rr2{X"), the sum-ratc i?*^(B,D) is bounded by i?'"'^(D) (according to equation ©), which 

'Note that if _Rjr(D*) > Ari(A") for i = l,j = 2 in (TT) then it also holds that _Rji.(D*) > Rri{\°) for i = 2,j = 1 and vice versa. 
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is also less than R^i{X'^) + R,-2{\'^). Therefore, the maximum sum-rate i?'*(B,D) in this subcase cannot achieve 
4i(A") + 42(A0). 

We specify j for this subcase so that the problem (fTTt is feasible but Rj,:(D*) < i?ri(A°). The following theorem 
characterizes the optimal solution in this subcase. 

Theorem 2: Denote the optimal D; in Subcase 1-2 as D^, G {1, 2} and the optimal A( as A^, VZ. The optimal 
strategies for the source nodes and the relay satisfy the following properties: 

1. min{l/Mr} < 1/A**,a < 1/A°; 

2. The relay uses full power P™'*'^; 

3. D* maximizes min{l//i;} among all D's that satisfy 

i?"^(D) > i?"'^(D*) (18a) 
Tr(Di) < F;"^'^,V? (18b) 

4. l/^l*<l/^l*. 

Proof: Please see Subsection IVI-DI in Appendix. 

While the original problem cannot be simplified into an equivalent form in this subcase, the properties in the above 
theorem help to significantly reduce the complexity of searching for the optimal solution by narrowing down the set 
of qualifying power allocations. Denote the Dj that maximizes /Jjr(Dj) subject to the constraints jij > /i^a and 
Tr{Dj} < F™^'' as D^- and the corresponding fij as According to Theorem 2, if R,i{Xi) + R,^{Xj) < i?™'^(D), 
where 

A. = ^l] (19a) 
Tr{P(A,)} + Tr{P(Aj)} = P^^ (19b) 
and D is the optimal solution of the following problem 

max R'^'^CD) (20a) 

D 

s.t. RjrCDj) > RjA'D]) (20b) 

Tr(D/)<P/'^^^,V/ (20c) 

then the optimal Bi,VZ in Subcase 1-2 is given by B; — Vr;Pr;(A;)Vj^ and the optimal D is the solution to the 
following power minimization problem 

min Tr{Di} + Tr{D2} (21a) 

i?'"^(D) >^i?,,(A,) (21b) 
I 

R^A'D^) > RrjC^j) (21c) 

i?,,(D,) > i?„(A,) (21d) 

Tr(Di)<Pr'''',VL (21e) 



D 
S.t. 



December 18, 2012 



DRAFT 



11 



If Rii{K) + Riji^j) > R'^^(D), then according to Theorem 2, the optimal solution can be found by maximizing 
the objective i?*'^(B,D), denoted as R°^^\ that can be achieved by both i?™''(D) and J^^rd^i) subject to the 
following two constraints: 1). 1/Aj = l/Aj (according to Lemma 2, Properties 1 and 4 of Theorem 2); 2). l/Xj 
is obtained by waterfilling the remaining power on arj{k),\/k G Ij (Property 2 of Theorem 2), where l/fij is the 
optimal value of the objective function in the following optimization problem (Property 3 of Theorem 2) 

max — (22a) 

D /Xj 

s.t. i?"^'*(D) > R°^^ (22b) 
Tr(DO < Pr'^'^VL (22c) 

Since maximizing l/nj is equivalent to maximizing Rj^CDj), the objective function of the above problem can be 
substituted by Rj,-{'Dj), and l/flj can be obtained from the optimal value of Rjr(Dj) in the above problem using 
(|9a] l or (|9b] l. As mentioned at the beginning of Subcase 1-2, the optimal _R*^(B, D) is less than J2 Therefore, 
starting from the point by setting R°^^ — ^i?ri(^")' we can adjust R°^' to achieve the optimal i?'"(B,D) by 
solving the following problem 

max Rjf(Dj) (23a) 

s.t. ff"''(D) > R°^^ (23b) 

Tr(D;) < /'r^'',VZ (23c) 

and obtain the resulting from the above problem. Setting 1/Xi = l/fij and allocating all the remaining power 
on arj(fc)'s, Vfc £ Ij, if the resulting ^i?i.;(A;) is less than R°^', then R°^' should be decreased and the above 
process should be repeated. If the resulting ^ Rii{Xi) is larger than R°^\ then R°^' should be increased and the 

above process should be repeated. The optimal solution is found when the resulting ^ Rii{Xi) is equal to i?'"^(D). 

I 

With an appropriate step size of increasing/decreasing R°^', R°^^ converges to the optimal _R*^(B, D) in the above 
procedure. 

After obtaining the optimal R°^\ 1 / flj and Xi, the source nodes need to solve the problem of power minimization, 
which is 

min Tr{Di} +Tr{D2} (24a) 
s.t. i?'"^(D) > (24b) 

R^r{^D^) > Rrj{Xj) (24c) 

RjrCDj) > 4^(A.) (24d) 
Tr(DO < P/°'''',V/. (24e) 

However, it can be shown that if Rj^{'Dj) is not the maximum that Rji-CDj) can achieve subject to the constraint 
(|22c| i (without the constraint (|22b| l). then B and D remain the same after solving the above problem. 
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TABLE I: Algorithm for finding the optimal solution for Case I 

1. Check if the problem (16) is feasible. If yes, find the optimal 
D from the problem l ll6t . The optimal B is given by Bi = 
VriPH(A°)Vj^, Vi. Otherwise, specify j so that the problem ((17) 
is feasible but Rj^(D*) < R,-i{X'') and proceed to Step 2. 

2. Obtain D^. and fi\. Calculate Xi,\/l using (19). Check if 

T,Rrl(M) < -R™''(D). If yes, the optimal B is given by B; = 

I 

VriPri(A;)V^,V/. Find the optimal D from (21). Otherwise, 
proceed to Step 3. 

3. Set R™'''' = J2Rvl{>^°) and i?™'" = 0. Initialize R°^i = R""^'' 

I 

and proceed to Step 4. 

4. Solve the problem (23) and obtain D and 1/fij- Set 1/Xi = 

i-fp-j- Allocate all the remaining power on a,-j(kY s,\/k £ Ij using 

waterfilling and obtain Check if | J] RriCA; ) - /J™='(D)| < e, 

I 

where e is the positive tolerance. If yes, proceed to Step 6 with R°^^ 
and Xi,\fl. Otherwise, proceed to Step 5. 

5.1f R"^''(D)-J2RA^i) > set fi™ = R°^i.liJ2RviM- 
l I 

^map) ^ ^min ^ ^obj ^ Let R°^i = (R™^" + i?""")/2 

and go back to Step 4. 

6. Solve the power minimization problem (24). Output D and B; = 

V,lF,l{Xl)Vf^,\/l. 



Using Property 2 of Theorem 2, it can be seen from (l2ll and (|24) that the minimization of total power consumption 
becomes the minimization of the source node power consumption in Subcase 1-2 since the relay always needs to 
consume all its available power for achieving optimality. 

The complete procedure of finding the optimal solution in Case I is summarized in the algorithm in Table |I] The 
algorithm finds the optimal solution either in one shot (Steps 1 and 2) or through a bisection search for the optimal 
R°^^ (Steps 3 to 5). Denoting A — R'^^^ — the worst case number of iterations in the bisection search is 

log(A/e). Within each iteration, a convex problem, i.e., problem (|23| ). is solved followed by a simple waterfilling 
procedure (linear complexity) for the given R°^K Therefore, the complexity of the proposed algorithm is low. 

Subcases I-l and 1-2 cover all possible situations for Case I that i?'"^(D°) > Rr-i{X°) + -Rr2(A°). 

B. Finding the optimal solution in Case II, i.e., i?™'*(D°) < Rri{X°) + -Rr2(A°) 

Since i?'"^(DO) < 4i(A°) + i?r2(A°), it can be seen using ([Hal, (HB and I© that I/Aq > l/Mma- The following 
four subcases are possible. 

Subcase II-l: I/mSi^ < min{l/Aii, 1/M2}- I" this subcase, the maximum i?**(B,D) is bounded by i?"^^(D"). 
The optimal D is D'', and consequently both source nodes use all their available power at optimality. It can be 
shown that the sufficient and necessary condition for B to be optimal in this subcase is that B is the optimal 
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solution to the following convex optimization problem 

min Tr{Bi+B2} (25a) 

s.t. i?,i(Bi) + 42(B2) > i?'""(D"). (25b) 

The solution of ( |25]) can be found in closed-form and it is given by = ViiPri(/i°„jj)Vj^, Vi. 

Subcase II-2: there exist i and j such that 1//^° < 1/A*nia < I/m? ^ 1/A". In this subcase, the maximum 
i?*^(B,D) is also bounded by i?'"'*(D°). Therefore, the optimal D is D" and both source nodes use all their 
available power at optimality. It can be shown that the sufficient and necessary condition for B to be optimal in 
this subcase is that B is the optimal solution to the following convex optimization problem 

min Tr{Bi+B2} (26a) 

B 

s.t. i?ri(Bi) + i?,2(B2) > i?'"'^(DO) (26b) 
4i(Bi) = RjA'D",). (26c) 

The solution of (|26] | can also be expressed in closed-form. The optimal Bi is given by Bi = ViiPii(/ij)Vj^ and 
the optimal Bj is given by B^ = VrjPrj(Aj)V,^, where satisfies Rrj{Xj) = ff"^(D'') - RjrCD"). 
Subcase II-3: there exist i and j such that < l/^tma < 1/A° < and there exists Xj such that 

RrjiXj) > i?'"^(D°) - i?,r(D°) (27a) 
Tr{Pr,(A,)} < - Tr{P„(A*°)}. (27b) 

The optimal solutions of B and D in this subcase are the same as those given in Subcase II-2. 

In the above three subcases, the maximum achievable _R*™(B,D) is i?™''(D°). Therefore, the original problem 
of maximizing i?'"(B,D) with minimum total power consumption in the network simplifies to the problem that 
the relay uses minimum power consumption to achieve the BC phase sum-rate _Ri.i(Bi) + _Ri.2(B2) that is equal 
to ff"''(DO). 

Subcase II-4: there exist i and j such that l/^j 1/A*ma < 1/A" < l/fJ,^ and there is no A^ that satisfies 
the conditions in (|27] |. In this subcase, the maximum i?(B,D) cannot achieve R™^{'D'~') although i?'"^(D*') < 
4i(A") + 42(A0). 

Theorem 3: Denote the optimal D; as D^, and the optimal A; as A^*, VL In Subcase II-4, the optimal strategies 
for the source nodes and the relay satisfy the following properties: 

1. min{l^;*} < < l/^o„,; 

2. Properties 2-4 in Theorem 2 also apply for Subcase II-4. 
Proof: See Subsection IVI-EI in Appendix. 

According to Theorem 3, the original problem of maximizing i?'"'(B, D) with minimum total power consumption 
becomes the problem that the source nodes and the relay jointly find the maximum achievable i?**(B,D) with 
the relay using all its available power and the source nodes using minimum power. From Theorem 3, it can be 
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TABLE II: Summary of the overall algorithm for network optimization. 

1. Initial power allocation. The source nodes solve the MA sum- 
rate maximization problem il4\ and obtain D'\ ^ir(D^'),Vi, and 
^mapO) jj^g relay obtains A" and ii;„(A''), Vi. 

2. Determining the cases. Check if ij™^ (D") > Y, R^i {-^' )■ K yes, 

i 

proceed to Step 3. Otherwise, proceed to Step 4. 

3. Case I. Determine the subcase based on /ij, /Xj, Mma' ■ ^""^ 
Subcase I-l, the relay's optimal strategy is = Vi.iPr,;(A'')V]|^ 
while the source nodes solve problem M6\ for transmission power 
minimization. For Subcase 1-2, use Steps 2 to 6 of the algorithm in 
Table |l] for deriving the optimal strategies for both the source nodes 
and the relay. 

4. Case II. Determine the subcase based on /^J, /Xj, A'ma' ^'^'^ ■ 
Subcases II- 1, II-2, and II-3, the optimal strategy for source i is 
and the relay minimizes its transmission power via solving the prob- 
lems (25) or (26). For Subcase II-4, substitute R}^^"^^ = J2Ril{^°) 
in Step 3 of Table U by R'^'''^ = R'^''(D°) and use Steps 2 to 6 of 
the algorithm in Table|l]for finding the optimal strategies for both the 
source nodes and the relay. 



seen that the optimal solutions in the Subcases 1-2 and II-4 share very similar properties. There is also an intuitive 
way to understand the similarity. Although Subcases 1-2 and II-4 are classified to opposite cases according to the 
initial power allocation, it is the same for them that i?,(B,D) cannot achieve R™^{'D'^). As a result, the relay 
needs to use as much power as possible and the source nodes need to decrease i?™^(D) from i?™^(D'^) until 
the maximum i?(B,D) can achieve i?™^(D). This similarity leads to the common properties of the above two 
subcases. Moreover, due to this similarity between Theorems 2 and 3, Steps 2 to 6 of the algorithm in Table |T] can 

be used to derive the optimal solution in Subcase II-4 if the part of R'^'^^ — ^ ijr;(A'^) in Step 3 is substituted by 

I 

Concluding Case I and Case II, the complete procedure of deriving the optimal solution to the problem of sum- 
rate maximization with minimum transmission power for the scenario of network optimization is summarized in 
Table El 

C. Discussion: efficiency and the effect of asymmetry 

In the previous two subsections, we find solutions of the network optimization problem in different subcases. 
Given the solutions found in the previous subsections, these subcases can now be compared and related to each 
other for more insights. 

The solutions found in all subcases are optimal in the sense that they achieve the maximum achievable sum-rate 
with the minimum possible power consumption. However, the optimal solutions in different subcases may not be 
equally good from another viewpoint which is power efficiency at the relay and the source nodes. Specifically, 
although the power allocation of the source nodes and the relay jointly maximizes the sum-rate of the TWR over 
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the MA and BC phases at optimaHty, the power allocation of these nodes may not be optimal in their individual 
phase of transmission, which is MA phase for the source nodes and BC phase for the relay. In fact, the power 
allocations in the two phases have to compromise with each other in order to achieve optimality over two phases. It 
is so because of the rate balancing constraints (I12ab and (I12bb . It infers that there is a cost of coordinating the relay 
and source nodes to achieve optimality over two phases. This cost can be very different depending on the specific 
subcase. In order to show the difference in this cost, we use the metric efficiency defined next. A given power 
allocation of the relay (source nodes) is considered as efficient if it maximizes the BC (MA) phase sum-rate with 
the actual power consumption of this power allocation. For example, if the power allocation of the relay consumes 
the power of < p™^^ at optimality and achieves sum-rate R^'^ in the BC phase, then this power allocation is 
efficient if R^"^ is the maximum achievable sum-rate in the BC phase with power consumption Py. It is inefficient 
otherwise. It can be shown that the chance that the optimal power allocation is efficient for both the relay and the 
source nodes is small (such situation is guaranteed to happen in Subcase II- 1 and it is possible only in one another 
subcase, i.e.. Subcase I-l). Therefore, a joint power allocation of the relay and source nodes is considered to be 
inefficient if it is inefficient for both the relay and the source nodes, and it is considered to be efficient otherwise. 
The following conclusions can be drawn for the scenario of network optimization. 

First, it can be shown that the optimal power allocation is efficient in Subcase I-l and generally inefficient in 
Subcase 1-2. Specifically, the optimal power allocation of the relay is always efficient in Subcase I-l while the 
optimal power allocation of the source nodes can be either efficient or inefficient. In contrast, the optimal power 
allocation of the relay is always inefficient in Subcase 1-2 while the optimal power allocation of the source nodes 
is also inefficient in general. For Case II, the optimal power allocation is efficient in Subcases II- 1, II-2, and II-3 
and generally inefficient in Subcase II-4. Specifically, the optimal power allocation of the source nodes is efficient 
in Subcases II- 1, II-2, and II-3 and generally inefficient in Subcase II-4 while the optimal power allocation of the 
relay is efficient in Subcase II- 1 and inefficient in Subcases II-2, II-3, and II-4. 

Second, the optimal power allocation in Subcase I-l achieves _Rii(A°) + i?r2(A°). In this subcase, the source 
nodes minimize their power consumption while achieving the maximum sum-rate and in general they do not use up 
all their available power at optimality. Unlike Subcase I-l, both source nodes may use up their available power in 
Subcase 1-2 while the achieved sum-rate is smaller than _Rii(A°) + i?i2(A°). Similarly, the optimal power allocation 
in Subcases II- 1, II-2, and II-3 achieves R"^^{'D'~') while the relay not necessarily uses up its available power In 
contrast, the optimal power allocation in Subcase II-4 consumes all the available power of the relay while the 
achieved sum-rate is smaller than i?™^(D°). Therefore, it can be seen that for Subcase I-l and Subcases II- 1, II-2, 
and II-3, in which the optimal power allocation is efficient, either the maximum possible sum-rate of the MA phase 
or that of the BC phase can be achieved at optimality. Moreover, the source nodes and the relay generally do not 
both use up their available power In Subcases 1-2 and II-4, in which the optimal power allocation is inefficient, 
the achieved sum-rate is however smaller than either the maximum possible sum-rate of the MA phase or that of 
the BC phase, while it is possible that all nodes use up their available power 
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Fig. 1: Illustration of the algorithm in Table |l] 
Third, it can be shown for Case I that the difference between niax{l//i°} and min{l//i^} increases in general 

i i 

as the subcase changes from Subcase I-l to Subcase 1-2. Similar result can be observed in Case II. As the subcase 
changes from Subcase II-l, via Subcases II-2 and II-3, to Subcase II-4, the difference between max{l/^?} and 

i 

min{l//i°} increases. 

i 

Last, from the definitions of , Vi, it can be seen that large difference between max{l//i^} and min{l//i°} can 

i i 

be, and most likely is, a result of asymmetry in the number of antennas, available power, and/or channel statistics 
at the two source nodes. It will also be shown in detail later in the simulations that such asymmetry can increase 
the occurrence of Subcases 1-2 and II-4. In contrary, if the two source nodes have same number of antennas, same 
available power and same channel matrices, then 1 ~ I/M2 > l/^ma- ^ result, only Subcase I-l and Subcase 
II- 1 are possible, in which the optimal power allocation is efficient. Combining this fact with the observations in 
the above three paragraphs, it can be seen that the asymmetry in the number of antennas, available power, and/or 
channel statistics at the two source nodes can lead to a degradation in the power allocation efficiency for the 
considered scenario of network optimization. As efficiency reveals the cost of coordination between the relay and 
source nodes required to achieve optimality over the two phases in the network optimization scenario, it can be 
seen that such cost is low in the case of source node symmetry and high otherwise. 

IV. Simulations 

In this section, we provide simulation examples for some results presented earlier and demonstrate the proposed 
algorithm for network optimization in Table |I] The general setup is as follows. The elements of the channels H^i and 
Hii , Vi are generated from complex Gaussian distribution with zero mean and unit covariance. The noise powers 
and ai-e set to 1. The rates i?"'''(D), Ri,-{B.i), and .Rrj(Di) are briefly denoted as i?''''', R„ and R^i, 
respectively, in all figures. 

Example 1: The process of finding the optimal solution for network optimization Subcase 1-2 using the proposed 
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(a) Percentage of the increase in sum-rate at optimality (b) percentage of the decrease in power consumption at optimality 

Fig. 2: Improvements as compared to relay optimization 

algorithm in Table \I\ The specific setup for this example is as follows. The number of antennas ni,n2, and n,- 
are set to be 6,4 and 8, respectively. Power limits for the source nodes are p™^'^ — 2,P2^^^ = 2.5. The relay's 
power limit is set to P.™'^'^ = 3. Since the optimality of the solution derived using the algorithm has been proved 
analytically by Theorem 3, we focus on demonstrating the iterative process and the convergence of the algorithm. 
Fig. [Tal shows instantaneous _R*^(B, D), R'^'^{'D) and -R(B, D) versus the number of iterations. From the figure, it 
can be seen that the above three rates converge very fast. Fig. [Tbl shows the instantaneous RiiCDi), i?ri(Ai), Vi and 
the power consumption of the source nodes 1 and 2, denoted as Pi and P2, respectively. Two observations can be 
drawn from Fig. [lb] First, Ri2{^2) < -Rii(Di) and i?ri(Ai) = -R2r(D2) in the optimal solution since the sum-rate 
is bounded by i?™'^(D) < ^i].(Di) + _R2i (D2). Second, both source nodes use all available power in the optimal 
solution. The latter observation verifies the conclusion that for Case I the optimal power allocation in Subcase 1-2 
is inefficient for using possibly more power and achieving less sum-rate. 

Example 2: Comparison with relay optimization in Part I. The specific setup for this example is as follows. The 
number of antennas at the relay, i.e., n^, is set to be 5. The power limit of the relay, i.e., p^^^^ is set to be 3. The 
total number of antennas at both source nodes is fixed so that ni +712 = 5. The total available power at both source 
nodes is also fixed so that p^^^ + pmax _ 2. Given the above total number of antennas and total available power 
at the source nodes, both the relay optimization and the network optimization problems are solved for different rii, 
77.2, P™'^^, and p^'^^ for 100 channel realizations. The percentage of the increase in the average sum-rate and the 
percentage of the decrease in the average power consumption at optimality of the network optimization problem 
compared to those at optimality of the relay optimization problem are plotted in Figs. |2a] and |2b] respectively. 
These percentages are shown versus the difference between the number of antennas and the difference between 
the power limits at the source nodes. From these two figures, it can be seen that although the optimal solution 
of the network optimization problem on average consumes much less power than that of the relay optimization 
problem, it still achieves larger sum-rate. Moreover, it can also be seen that the improvements, in either sum-rate or 
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power consumption of the optimal solution of the network optimization problem as compared to that of the relay 
optimization problem, become more obvious when there is more asymmetry in the system. This is because the 
source nodes and the relay can jointly optimize their power allocations and therefore cope with (to some extent) 
the negative effect of the asymmetry in the system in the network optimization scenario. In contrast, the relay 
optimization scenario does not has such capability to combat the negative effect of asymmetry. 

Example 3: The ejfect of asymmetry in the scenario of network optimization. First, we solve the network 
optimization problem for different p^^^ and p™^'^ given that p™^^ is fixed. The number of antennas of the 
relay is set to 8 and the number of antennas of both source nodes is set to 4. For each combination of p™^'^ and 
pmax^ we use 200 channel realizations and solve the resulting 200 network optimization problems. The number 
of times that Subcases 1-2 and II-4 appear are plotted in Fig. [3] In this figure, the points in the upper surface 
correspond to the counts of Subcase 1-2 while the points in the lower surface correspond to the counts of Subcase 
II-4. From Fig. [3] it can be seen that in general the count of either Subcase 1-2 or Subcase II-4 is the smallest 
when p™^'' = pmax Moreover, for any given p™^^ or p™^'^^ the largest count of either Subcase 1-2 or Subcase 
II-4 mostly happens where the difference between p^^^^ and p^^^ is the largest^ The above two observations 
are accurate for most of the times in Fig. |3] which shows that the asymmetry of p™^'*^ leads to the rise of the 
occurrence of Subcases 1-2 and II-4. 

Next we demonstrate the effect of asymmetry in the number of antennas at the source nodes. The number of 
antennas of the relay is still 8 and P™^^ is still 4. However, the number of antennas of sources nodes 1 and 2 are 
first set to 4 and 6 and then 6 both, respectively. The network optimization problem is solved for different p™^'^ and 
pmax ^jjj j.jjg g^jj^ jjjg counts of Subcases 1-2 and II-4 in 200 channel realizations is plotted in Fig. |4]for each 
combination of p™^'' and P™^^. From Fig. Ha] it can be seen that the sum of the counts of Subcases 1-2 and II-4 

'Note, however, that subcases are also determined by the ratio of the number of antennas at the relay to the number of antennas at the source 
nodes, the ratio of p^^^ to P™^", Vi, the channel realizations and other factors, instead of only by P^^^jVi. 
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(a) Sum of the Counts of Subcases 1-2 and II-4 versus p^'^^ and (b) Sum of the Counts of Subcases 1-2 and II-4 versus p™^" and 
pmax^ Jll = 4, n2 = 6 Pj™''", m = n2 = 6 

Fig. 4: Illustration of the effect of asymmetry in the number of antennas at the source nodes. 





(a) Sum of the Counts of Subcases 1-2 and II-4 versus vi and V2 (b) Sum of the Counts of Subcases 1-2 and II-4 versus vi and V2 
(without assuming channel reciprocity) (assuming channel reciprocity) 

Fig. 5: Illustration of the effect of asymmetry in channel statistics. 



substantially increases when ni — 4 and n2 = 6 as compared to the sum of the counts in Fig. |3] on most of the 
points. However, as shown in Fig. |4b] when rii = n2 = 6, the sum of the counts of Subcases 1-2 and II-4 drops 
to the same level as the sum of the counts in Fig. |3] Therefore, it can be seen that asymmetry in the number of 
antennas at the source nodes leads to larger chance of Subcases 1-2 and II-4. 

Lastly, we show the effect of asymmetry in channel statistics. Instead of generating the real and imaginary parts 
of each element of H^, , Vi from Gaussian distributions with zero mean and unit variance, here we use Gaussian 
distribution with zero mean and variance Vi to generate the real and imaginary parts of each element of H^i , Vi. For 
each combination of vi and V2, we use 200 channel realizations and solve the resulting 200 network optimization 
problems. The number of antennas at the relay is set to 6 and the number of antennas at both source nodes is set 
to 4. The power limits are p™^'^ = 5 and p™^'^ = 3,Vi. The sum of the counts of Subcases 1-2 and II-4 is plotted 
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in Fig. |5] versus vi and V2- Fig. |5al corresponds to the case without assuming channel reciprocity, in which the real 
and imaginary parts of each element of Hii,Vi are generated from Gaussian distributions with zero mean and unit 
variance. Fig l5b] corresponds to the case of reciprocal channels, i.e.. Hi, = H^., Vi where (•)^ represents transpose. 
It can be seen from both Fig. ISaland Fig. |5b]that the sum of the counts of Subcases 1-2 and II-4 tends to increase 
when the difference between vi and V2 becomes larger Therefore, Fig. |5] clearly shows that the asymmetry in the 
channel statistics also leads to larger chance of Subcases 1-2 and II-4. 

V. Conclusion 

In Part II of this two-part paper, we have solved the problem of sum-rate maximization using minimum trans- 
mission power for MIMO DF TWR in the scenario of network optimization. For finding the optimal solution, we 
study the original problem in two cases each of which has several subcases. It has been shown that for all except 
two subcases, the original problem can be simplified into corresponding convex optimization problems. For the 
remaining two subcases, we have found the properties that the optimal solution must satisfy and have proposed the 
algorithm to find the optimal solution based on these properties. We have shown that the optimal power allocation 
in these two subcases are inefficient in the sense that it always consumes all the available power of the relay (and 
sometimes all the available power of the source nodes as well) yet cannot achieve the maximum sum-rate of either 
the MA or BC phase. We have also shown that the asymmetry in the number of antennas, power limits, and channel 
statistics leads to a higher probability of the above-mentioned two subcases. Combining with Part I of this work, 
we have provided a complete and detailed study of sum-rate maximization using minimum power consumption for 
MIMO DF TWR. 

VI. Appendix 

A. Proof of Lemma 1 

Proof for claim 1: Given D as defined in the lemma, it follows that i?'"''(D) = Rj^{T>j). From the definitions 
(l9ali-(|9c]l, it can be seen that i?™'»-(D) > Rjr(Dj) = Rjr(Dj) if l//ima(D) > Therefore, it is necessai-y that 

l//^ma(D) < 

Proof for claim 2: First, note that i?'"*(D) is a continuous and strictly increasing function of t in [0, 1]. Second, 
based on the definition (|9c] l. it follows that _R'"''(D) is a strictly increasing function of 1/ /Xma(D) when l//i,„a(D) > 
min{l/a,(fc),Vi,Vfc}, or equivalently, R"'^{T>) > 0. Since Tr{Di} > and Tr{D2} > 0, we have ff°''(D) > 
for all t G [0, 1]. Thus, given the fact that l//^ina(D)< I/Mj when t = and that l//ima(D) = l/Mma(D) > 
when t — 1, it can be seen that there exists t G [0, 1) such that l//ii„a(D) = I/Mj when t — t. Using Lemma 1 in 
Part I of this two-part paper L2J, i.e., l/^ma<uiax{l//xi, 1/^2}, it can be seen thatl/^i(Di) > l//inia(D) — 
when t = t. ■ 

B. Proof of Lemma 2 

Consider the first part of constraint (ITTi . i.e., Xj — jii > /ii„a if < Aj. Using Lemma 2 in Part I fT\, it can 
be seen that Ai,Vi satisfy Ai = A2 if min{l//ii} > l//ima at optimality. Therefore, we have min{l//^i} < l//^ma 
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given that Ai ^ A2. Using the same lemma and the constraint (|13at . it can be further concluded that < l//^ma 
at optimality given that A; < \j. Otherwise, the constraint (I13bl i cannot be satisfied. Therefore, > l/Mma 

according to Lemma 1 in Part I |2|. Due to the constraint (ll3aK we must have 1/Aj < 1//^^ at optimality. Moreover, 
from Lemma 2 in Part I and the assumption that Ai < Aj, it can be seen that 1/Aj < is not optimal. Therefore, 
1/Aj = if Ai < \j. Following the same approach, the second part, i.e., Aj = /i; > /Xma if Mi > Mma can be 
proved similarly. 

C. Proof of Theorem 1 

Recall the definitions of /^i, /i2, and /ima in (l9a]l-(|9c|l. Considering the constraints (I16b) - (ll6dt in the problem 
( fT6] l. it can be seen that at optimality we must have /i*^^^ < A°, /i{ < A'^, and ^2 ^ Otherwise, the above 
mentioned constraints cannot be satisfied. We will prove Theorem 1 by contradiction. 

Assume that /i^^^^ 7^ A" at optimality, then 1.1*^^^ < A*^ according to the above paragraph. Using Lemma 1 in Part I 
of this two-part paper [|2], i.e, l//ima < max{l//ii, I/1J.2}, and given that fi\ < A° and < A°, there are only two 
possible cases as follows: a) max{l/ ^i^, l/fJ.2} > l/^ma > I/M2} ^ 1/A° and b) max{l///J, l//^2} ^ 

min{l//i5;, l//i2} > l//^ma > l/A*^. Assume without loss of generality that max{l//i{, l/zij} = 1//^* and 
min{l//i|, I//I2} = 1/M2- i'^ i^ Case a), then we have > l/^ma > I/M2 — l/^*^- Use Lemma 1 (of 

Part II) with = fD^ and = D2. As proved in Lemma 1, there exists t G [0,1) such that /^i(tD{) > 
l//ima([iD{,D^]) = Since > 1/A°, we have Aii(iDl) > l/Mma([tD{, D^]) = V/iJ > l/A^, which 

indicates that D = [tD|, D^] also satisfies (fT6bb-(fT6e]i while Tr{iD^}+Tr{D5} < Tr{D^}+Tr{D5}. It contradicts 
the fact that D* = [DJ, Dj] is the optimal solution to the problem (fTSI l. Therefore, Case a) is impossible. If it is Case 
b), there exist two following possible subcases: subcase b-1) there exists i G {1,2} such that l//i,iia(D) = 1/A" 
and l/^i(Di) > l//^,na(D) where D = [Di, D2] with = i^D* and Dj = D* for some ti e [0, 1) and subcase 
b-2) there does not exist U e [0, 1) such that l/^ma(6) = 1/A° and 1/^^(6^) > l/^ma(D) where D = [61,62] 
with Di = tiD* and Dj ~ D* for either i = 1 or i = 2. In subcase b-1), it can be seen that D satisfies ( I16bl )-( ll6el ) 
while Tr{t,D*} + Tr{D*} < Tr{D5;} + Tr{D^}. It contradicts the fact that D* = [DJ, DJ] is the optimal solution 
to the problem ( fTSI l. Therefore, subcase b-1) is impossible. If it is subcase b-2), it indicates that with ti G [0, 1), 
for either i = 1 or i = 2, such that l//ima(6) = 1/A°, we have 1/Aii(6,) = l//i,(ti6^) < l//ima(6) = 1/A°. 
As a result, there exists t'^ G {U, 1) such that l/fii{t'J)*) = 1/A" and l//i,na(D') > l/A^ where D' = [D;,Dy 
with DJ = <JD* and D^ = D*. Note that l//^,na(D') > 1/A° because if l/Aii(D';) = 1/A° and l/^„ia(D') = 
l/A'^ then it is subcase b-1) instead of subcase b-2). Recalling that l//ij(D*) > l//ima(D*) > l//ima(D')' we 
have l//ij(D*) > l//iijia(D') > l//ii(D,J) = I/Aq. It indicates that by changing D* in the optimal solution to 
D'; = <iD* (and thus using less power than Tr{DJ} + Tr{D2} while satisfying ( |16bt -( fT6el )), subcase b-2) changes 
to Case a). As it is proved that Case a) is impossible at optimality, so it is subcase b-2). 

Therefore, it is proved that the assumption jj.*^^^ ^ A" must lead to either of two cases while both of them are 
impossible at optimality. Thus, it is impossible that fi^^^ 7^ A°. As a result, we must have n*^^ = A'^. This completes 
the proof. ■ 
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D. Proof of Theorem 2 

Proof of property 1. First we show that l//ima < 1/A". Since the maximum Rj^{Y)j), as the objective function of 
the problem ( fTTI l. cannot achieve Rn{\^) in Subcase 1-2, it can be seen that < 1/A" whenever l/z^ma > 1/A" 
and > 1/A*'. As a resuh, any D such that l//ima > 1/A" is not optimal. The reason is that in such a case 
the optimal relay power allocation requires l/A,: = < l/A'' according to Lemma 2 and such relay power 

allocation leads to a BC phase sum-rate ^_Ri;(A;) which is less than i?ii(A") + i?r2(A") according to Lemma 2 
in Part I Since l/^ma > 1/A° implies that i?™''(D) > ^ri(A°) + ^r2(A°), it can be seen that the consti-aint 
(I12bl i is not satisfied and therefore such strategies cannot be optimal. Next we show that min{l//i^} < l/nl^^. 
Assuming that l//i,*ia < it leads to l//i,*ia < l/A*' given that the problem ( fT6] l is infeasible. Moreover, 

it also leads to the result that A,* = /i*jjj,V/. However, it is not difficult to see that i?™^(D), ^i?r;(A/) and 

I 

eventually i?*^(B, D) can be increased in this case through appropriately increasing 1 / /ima, which is feasible since 
VM?na > l/A*' > l//^*-,a, and at least one of l/A^ and l/A^, which is also feasible since l/A* = l/^ma < l/A", 
given that l//^i*ia < ^inll/A^;"} ^^'^ l/^ma < l/A". It contradicts the assumption that D* and B* are the optimal 
solution. Therefore, 1/ > min{l//i^}. 

Proof of property 2. Given the fact that < l/A", the problem boils down to finding the maximum 

l//^ina such that the corresponding rate _R™'*(D) can also be achieved by the BC phase sum-rate ^i?ii(A;) 
subject to the first constraint in (fTOt and the constraint that min{l/A;} = minjl//^;} as stated in Lemma 2. 
Since the maximum ^i?ri(A;) cannot achieve R™^{D) subject to the above-mentioned constraints as long as 

l//^ina > l/A", the problem is equivalent to finding the maximum achievable ^ R,:i{\i) subject to the constraints 

I 

that min{l/Ai} = min{l//i;} and that i?'"^(D) = J2Rrii>'i)- Since ff°''(D) can achieve up to i?'"^(D"), it is 

I I 

not difficult to see that the maximum achievable ^ A;) subject to the above-mentioned constraints demands 

I 

the relay to use full transmission power P^^^^^. 

Proof of property 3. Define the index such that min{l//i;} = 1//^,-. Recall from the proof of property 1 
that l/^tma < l/A". As a result, _R™''(D*) is not the maximum i?'"'*(D) that can be achieved, which implies that 
there exists such that i?™''(D'') > i?'°''(D*) and ^,;-r(D--) > ^i-r(D*_) - S where 5 is a positive number. 
Define Z = Hii.DiH5^^ + H2rD2H^j,. It can be seen that i?™''(D) is a concave function of Z. If D* is not the 
optimal solution to the problem of maximizing min{l//x;} subject to the constraints in (fTsT l. there exists D^^ such 
that i?™^(Dq) > i?™^(D*) and ^i-,(D^_) > ^,-,(D*_). Then, for any < a < 1, these exists 0^= such that 
= aT)f + (1 - Q;)Df,VZ. Moreover, for any a such that 

R,^,(D*_) - R,-,(Dl_: 



<a<l, (28) 



i?,-,(D^)-i?,-,(D^_; 

it can be shown that _Rj-i.(DJ_) > _Rj-i.(D*_) using the fact that _R;i(Di) is concave with respect to D;,VL 
Denoting Z^ = Hi,T>fllf^ + H2rT>^ng and Z" = HirD|H?, + H2rD|Hg,, it can be shown that D^,V? lead to 
Z= = aZ<5 + (l-a)Z" and therefore i?'"^(D=) > ai?"^^(Dq) + (l-a)i?'"^(D'^) > i?'"^(D*). Therefore, if D* does 
not maximize Ri-^^Di-) subject to the constraints in ( fTsT l. then /ij-t.(Dj-) and i?™^(D) can be simultaneously 
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increased. The fact that i?j-r(Dj-) can be increased means that mm{l/fj,i} can be increased, which impUes that 
the BC phase sum-rate J2 RiiiM) can be increased according to Lemma 2 in Part I [|2| subject to the constraint 
that min{l/A;} = min{l//i;} as stated in Lemma 2. Given this resuh, the fact that i?'"''(D) can be simultaneously 
increased suggests that i?*^(B,D) can be increased. This contradicts the fact that D* is the optimal solution that 
maximizes i?*^(B,D) with D* subject to the related constraints. Therefore, D* must maximize min{l//ii} subject 
to (HI. 

Proof of property 4. It can be seen that the maximum achievable subject to the constraints 

E'°''(D) > , Tr(Di ) < PI'''''', VI (29) 

is a non-increasing function of R°^K If < l/z^j. according to property 1 of this theorem and the fact that 
1 / A*ma < niax{l//i(}, it can be shown that > 1/ /i*„a- Since l//i°„a > ™d the maximum achievable 
is a non-increasing function of R°^\ there exists D such that > l/Aj ^"d l/fij ~ l//ima > l/^ma- Using 
1//Ltma < max{l//i;} from Lemma 1 in Part I Q, it can be shown that 1/ fli > ^/jj-j = l/Ama at this point. Since 
the maximum Rjr{T>j) cannot achieve fJriCA") in the problem ( fTTI i. it can be seen that l/jlj = l//ima < 1/A". 
In such a case, the optimal strategy of the relay is to use 1/A; = l//ima < 1/A°,VZ, which does not consume the 
full power of the relay. Therefore, according to property 2 of this theorem, the i?**(B,D) that can be achieved, 
specifically R™^{T>), in the case that — l//ima is not the maximum that _R*^(B,D) can achieve. Moreover, 
since l//ima > l/A^ma' it can be seen that ff"''(D*) < ff"^(D). As a result, i?**(B*, D*) i?"'^(D*) < ff"^(D). 
Using the above-proved fact that i?'"^(D) is not the maximum that i?'" (B, D) can achieve, this result obtained under 
the assumption 1//^* < contradicts the assumption that B* and D* are optimal. Therefore, the assumption 

that 1/^* < l/^j must be invahd. ■ 

E. Proof of Theorem 3 

The proof follows the same route as the proof of Theorem 2. 

Proof of property 1. As there exists no Xj which satisfies the constraints in (|27] |. it can be seen that J^Riii^i) 

I 

cannot achieve i?"^'*(D°) subject to the constraint = fJj, which is necessary as stated in Lemma 2. Therefore, 
it is necessary that < l//i"ja- Given that l//i*„a < l/^ma' it can be shown that the resulting i?''"(B,D) is 

not maximized if l//^ma — min{l//X;*}. Therefore, it is necessary that l//^ma > min{l//X;*}. 

Proof of properties 2-3 from Section IVI-DI can be applied here after we substitute all A" therein to fJ^^. Proof 
of property 4 of Theorem 2 can be directly applied here. Thus, property 2 of Theorem 3 is proved. ■ 
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